# Florence-2 Fine-tuning
Florence 2 FT DocVQA
MIT
A document visual question answering model fine-tuned based on Florence-2-base, specifically designed for handling QA tasks in document images.
Image-to-Text
Transformers English

F
sahilnishad
4,928
0
Florence 2 VLM Doc VQA
A specialized version for Visual Question Answering (VQA) fine-tuned based on microsoft/Florence-2-base-ft, capable of interpreting image content and answering related questions
Text-to-Image
Transformers English

F
prithivMLmods
69
4
Florence 2 FT Lung Cancer Detection
A lung cancer detection model fine-tuned based on Florence-2-base-ft, identifying lung cancer types through lung images
Text-to-Image
Transformers English

F
nirusanan
20
1
TF ID Base
MIT
TF-ID is a series of object detection models specifically designed to extract tables and figures along with their caption texts from academic papers.
Image-to-Text
Transformers

T
yifeihu
408
36
TF ID Large
MIT
TF-ID is a visual object detection model specifically designed for extracting tables and charts from academic papers, fine-tuned based on Florence-2
Object Detection
Transformers

T
yifeihu
9,893
21
Featured Recommended AI Models